Model-Based Reinforcement Learning Method for Microgrid Optimization Scheduling
نویسندگان
چکیده
Due to the uncertainty and randomness of clean energy, microgrid operation is often prone instability, which requires implementation a robust adaptive optimization scheduling method. In this paper, model-based reinforcement learning algorithm applied optimal problem microgrids. During training process, current learned networks are used assist Monte Carlo Tree Search (MCTS) in completing game history accumulation, updating network parameters obtain strategies simulated environmental dynamics model. We establish environment simulator that includes Heating Ventilation Air Conditioning (HVAC) systems, Photovoltaic (PV) Energy Storage (ES) systems for simulation. The simulation results show microgrids both islanded connected modes does not affect effectiveness algorithm. After 200 steps, can avoid punishment exceeding red line bus voltage, after 800 result converges loss values value reward converge 0, showing good effectiveness. This proves proposed paper be
منابع مشابه
Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm
: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...
متن کاملAn integrated approach for scheduling flexible job-shop using teaching–learning-based optimization method
In this paper, teaching–learning-based optimization (TLBO) is proposed to solve flexible job shop scheduling problem (FJSP) based on the integrated approach with an objective to minimize makespan. An FJSP is an extension of basic job-shop scheduling problem. There are two sub problems in FJSP. They are routing problem and sequencing problem. If both the sub problems are solved simultaneously, t...
متن کاملModel-Free Trajectory Optimization for Reinforcement Learning
Many of the recent Trajectory Optimization algorithms alternate between local approximation of the dynamics and conservative policy update. However, linearly approximating the dynamics in order to derive the new policy can bias the update and prevent convergence to the optimal policy. In this article, we propose a new model-free algorithm that backpropagates a local quadratic time-dependent Q-F...
متن کاملReinforcement Learning: Model-based
Reinforcement learning (RL) refers to a wide range of dierent learning algorithms for improving a behavioral policy on the basis of numerical reward signals that serve as feedback. In its basic form, reinforcement learning bears striking resemblance to ‘operant conditioning’ in psychology and animal learning: actions that are rewarded tend to occur more frequently; actions that are punished ar...
متن کاملModel Based Reinforcement Learning with Final Time Horizon Optimization
We present one of the first algorithms on model based reinforcement learning and trajectory optimization with free final time horizon. Grounded on the optimal control theory and Dynamic Programming, we derive a set of backward differential equations that propagate the value function and provide the optimal control policy and the optimal time horizon. The resulting policy generalizes previous re...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Sustainability
سال: 2023
ISSN: ['2071-1050']
DOI: https://doi.org/10.3390/su15129235